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Abstract 

The most common approach to mitigate the impact that the presence of malicious nodes has on the accuracy of 
decision fusion schemes consists in observing the behavior of the nodes over a time interval T and then removing 
the reports of suspect nodes from the fusion process. By assuming that some a-priori information about the presence 
of malicious nodes and their behavior is available, we show that the information stemming from the suspect nodes 
can be exploited to further improve the decision fusion accuracy. Specifically, we derive the optimum fusion rule 
and analyze the achievable performance for two specific cases. In the first case, the states of the nodes (corrupted or 
honest) are independent of each other and the fusion center knows only the probability that a node is malicious. In 
the second case, the exact number of corrupted nodes is fixed and known to the fusion center. We also investigate 
the optimum corruption strategy for the malicious nodes, showing that always reverting the local decision does not 
necessarily maximize the loss of performance at the fusion center. 

Index Terms 

Adversarial signal processing, byzantine nodes, distributed detection with corrupted reports, data fusion. 
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I. Introduction 

Distributed detection in the presence of corrupted nodes, often referred to as byzantine nodes m, or simply 
Byzantines, has received an increasing attention for its importance in several application scenarios, including wireless 
sensor networks m, 11, cognitive radio a, Q, distributed detection a, multimedia forensics 13 and many others. 

The most commonly studied scenario is the parallel distributed data fusion model. According to such a model, the 
n nodes of a multi-sensor network observe a system over a period of time T through the vectors Xi{t), X 2 {t).. .Xn{t), 
t = 1.. .T. Based on such vectors, the nodes compute n reports and send them to a Fusion Center (FC). The fusion 
center gathers the local reports and makes a final decision about the status of the observed system at each time 
instant t. Hereafter, we assume that the system can be only in two states (0 and 1), additionally we make the 
simplifying assumption that the reports correspond to binary values whereby the local nodes inform the fusion 
center about their knowledge on system status. Specihcally, we indicate hy r^j G {0,1}, z = 1. .. n, j = 1... T the 
report sent by node i at time j. 

Decision fusion must be carried out in an adversarial setting, that is by taking into account the possibility that 
some of the nodes malevolently alter their reports to induce a decision error. This is a recurrent situation in many 
scenarios wherein the nodes may make a proht from a decision error (see H] for a general introduction to this 
topic). More rigorously, since the nodes do not know the exact system status they must estimate it based on the 
observation vectors Xi{t). Let us denote by Uij the binary local decision made by node i regarding the status of 
the system at time j. While for honest nodes Vij = Uij, we assume that malicious nodes flip Uij with a certain 
probability Pmai- In such a case we obviously have Uij ^ r^. Hereafter, we assume that the state of each node 
(honest or byzantine) remains constant over the whole observation period T. In addition, we make the simplifying 
assumption that the flipping probability does not depend on the observed sequence Xi{t). A pictorial representation 
of the setup described above is given in figure [T] If we assume that the FC makes the decision on system status at 
time t by considering only the reports corresponding to j = t, then the optimum way to combine the local decisions 
according to the Bayesian approach has already been determined in ID, Q, and is known as Chair-Varshney rule. 
In the case of symmetric error probabilities at the nodes. Chair-Varshney rule corresponds to simple majority-bases 
decision. Better results can be obtained by assuming that the FC collects the reports received from the nodes over 
a time window T, and decides on all the states assumed by the system at the end of the observation interval. Given 
that node states are constant over T, it is easy to argue that observing the reports from the nodes over many instants 
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Fig. 1; Sketch of the adversarial decision fusion scheme, 
allows the FC to identify byzantine nodes and hence reduce the error probability of the final decision. 

II. Prior work and contribution 

The problem described in the previous section, along with several variants, has already been studied in several 
papers, including 0 , a, m, m. In the great majority of the cases, the fusion center observes the node reports 
over the time window T trying to identify byzantine nodes. Once the byzantine nodes have been identified, the 
fusion center isolates them by removing their reports from the fusion process. The most relevant works for this 
paper are 0, ifTOll . which generalize the analysis carried out in 0. Even in these works the malicious nodes do 
not take advantage of the knowledge of the observed sequences Xi{t), do not cooperate with each other, and do 
not know the true state of the system. In contrast to 0, the analysis is not limited to the asymptotic case. The 
papers propose two methods to identify the byzantine nodes by observing the behavior of the nodes over the time 
window T and assigning them a reputation measure. Then, decision fusion is carried out by discarding the nodes 
with a low reputation. A different isolation scheme based on adaptive learning is described in im, where the 
observed behavior of the nodes over time is compared with the expected behavior of honest nodes. A peculiarity 
of this scheme is that it works even when the majority of the nodes are byzantine, but it requires very large T to 
achieve good performances. Tolerant schemes which mitigate the impact of Byzantines in the decision have also 
been proposed, like in ifT^ . where the reports are weighted differently according to node reliability but no one is 
removed from the network. 

In this paper, we start from the observation that in the presence of some a-priori information about the probability 
distribution of Byzantine nodes and their behaviour, removing the reports received from suspect nodes is not the 
optimum strategy, since such reports may still convey some useful information about the status of the observed 
system. Specifically, we derive the optimum fusion rule, by assuming that the probability distribution of malicious 
nodes across the network and the flipping probability Pmai are known. We analyze the performance of the system 
for two particular setups: according to the first one the states of the nodes (corrupted or honest) are independent 
of each other and the fusion center knows only the probability that a node is malicious. In the second case, the 
exact number of corrupted nodes is known to the fusion center. Having derived the optimum fusion strategy, we 
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turn our attention to the optimum attacking strategy for the Byzantines, in term of flipping probability, showing 
that in some cases always flipping the local decision (Pmai = 1) is not the best choice. 

The rest of this paper is organized in two main sections: in Section we derive the optimum fusion rule, while 


in Section IV we use simulations to analyze the performance of the optimum fusion rule. 


III. Optimum fusion rule 

In the rest of the paper, we will use capital letters to denote random variables and lowercase letters for their 
instantiations. Let S'^ = {Si, S 2 ■ ■ ■ St), with Si G {0,1}, be a sequence of independent and identically distributed 
(i.i.d.) random variables dehning the states assumed by the system. We assume all the states to be equiprobable. 
We indicate by Uij G {0,1} the local decision made by node i about the system status at time j. We exclude any 
interaction between the nodes and assume that Ui/s are conditionally independent for a fixed status of the system. 
This is equivalent to assuming that the local decision errors are i.i.d. 

It is assumed that the state of the nodes in the network is not known to the FC, which however has some prior 
statistical knowledge about the presence of Byzantines. To elaborate, let = {Ai... An) be a binary random 
vector describing the states of the nodes over the network. More specihcally, each random value Ai denotes the 
state of node v. Ai = Q (res. Ai = 1) indicates that node i is honest (res. byzantine). In the scenario considered in 
this paper, the FC is supposed to know the prior probability of A^, i.e. PA^{a^) or simply P{a^). 

Finally, we let R = {Rij}, * = ^ ■ ■n,j = l...rbe a random matrix with all the reports received by the 
fusion center, and r = a specihc instantiation of such a matrix. As stated in the introduction, Rij = Uij for 
honest nodes, while P{Rij ^ Uij) = Pmai for byzantine nodes. According to our model, local decisions Uij are 
flipped independently of each other with equal probabilities, so that the action of malicious nodes can be modeled 
as a number of independent binary symmetric channels with crossover probability Pmai- 

We are now ready to derive the optimum decision rule at the FC. Given the received reports r and by adopting 
a maximum a posteriori probability criterion, the optimum decision rule which minimizes the error probability Pe 
can be written as: 


= argmaxP(s’^lr). (1) 

By applying Bayes rule and exploiting the fact that the all state sequences are equiprobable we get: 

= argmaxP(r|s^). (2) 

Since in our model the states of the nodes are themselves random variables, we can write; 
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s'^’* =argmax^P(r|a”,s^)P(a") 

a'^ 

= argmax^ P(ri|a,, s^)")P(a”) 


an ^ i—l 

n T 


= argmax^ ( H 111“*’)^(a”), 


(3) 

(4) 

(5) 


where indicates the z-th row of r. In Q we exploited the fact that, given the state of the nodes and the states 
of the system over T, the reports sent by the nodes are independent of each other, while (|^ derives from the 
observation that, given the node state, each report depends only on the corresponding status of the system. 

We now consider the case in which the probability of a local decision error, say e, is the same regardless of 
the system status, that is £ = P{Uij ^ Sj\Sj = Sj), Sj = 0,1. For a honest node such a probability corresponds 
to the probability that the report received by the FC does not correspond to the system status. This is not the 
case for byzantine nodes, for which the probability of a wrong report, hereafter indicated by 6, can be written as 
S = £(1 ~ Pmal) + (1 ~ s)Pmal- 

According to the above setting, the nodes can be modeled as binary symmetric channels, whose input corresponds 
to the system status and for which the crossover probability is equal to £ for the honest nodes and 6 for the byzantine 
nodes. With regard to e we can safely assume that such a value is known to the fusion center. The value of <5 
depends on the value of Pmai which is chosen by the Byzantines and then should be optimally determined in a 
game-theoretic framework (see for an example in this sense). While this is an interesting possibility, we leave 
such an analysis to a future work, and assume that Pmai is known to the FC. From the optimum decision rule 
becomes: 


n^q{i) ^T-n^q{i) 


s^'* = argmax^ ( (1 ~ 

(1 _ 


(6) 


P(a"), 


where neq{i) is the number of j’s for which = Sj. 


In Section III-A and III-B we consider two simple, yet insightful, cases of prior knowledge on the distribution 
of the states of the nodes in the network. 


A. Independent node states 

In this section we consider a situation in which the states of the nodes are independent of each other and the 
state of each node is described by a Bernoulli random variable with parameter a, that is P{Ai = 1) = a, Vi In this 
way, the number of byzantine nodes in the network is a random variable with a binomial distribution. In particular, 
we have P(a") = P{ai), and hence Q can be rewritten as: 

sT* = argmax f F’(ri|ai, s'^)P(ai)y (7) 
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The expression in round brackets corresponds to a factorization of P(r, a"|s^). If we look at that expression as a 
function of a”, it is a product of marginal functions. By exploiting the distributivity of the product with respect to 
the sum we can rewrite 0 as follows 




= argmaxJJ ( ^ P( 




( 8 ) 


which can be computed more efficiently, especially for large n. The expression in equation (|^ can also be derived 
directly from (|^ by exploiting first the independence of the reports and later applying the total probability law. 


Since, as noticed before, P{t 


) = nj=i from 0 it is easy to derive the to-be-maximized 


expression for the case of symmetric error probabilities at the nodes, which is 


n 

=argmax]J [(1 - a)(l - 

+a{l - 


(9) 


B. Fixed number of Byzantines 

In this section we consider a scenario in which the exact number of Byzantines in the network is known to 
the FC. Let us indicate such a number with k (to draw a parallelism with the analysis carried out in the previous 
sections, k should be equal to the average number of Byzantines nodes that is k = an). Since the overall number 
of Byzantines is fixed, the state of a node is no longer independent from the states of the other nodes. More 
specifically, we have P(a”) = (^) ^ if the numbers of I’s in a" is k, 0 otherwise, since the only configurations 
with non-zero probability are those with exactly k Byzantines. Let I be the indexing set {l,2,...,n}. We denote 
with Ik the set of all the possible fc-subsets of I. Let I G Ik be a random variable with the index of the Byzantine 
nodes, a node i being byzantine if i G I, honest otherwise. Then, from 0 we derive 

= argmax P(r| J, s^)p(s^), (10) 

gT ( ^ 

/eXfc 

which in the symmetric case corresponds to 

= argmax ^ 

^ /GXfe '^iG/ 

iGl\/ ^ 

We conclude this section observing that implementing equation 0 is more complex than 0, due to the summation 
over all the possible fc-subsets of I. 

IV. Simulation results and discussion 

We run simulations to estimate the error probability at the FC for both the case of independent node states and 
fixed number of nodes. Let Pe,R and Pg.F be the error probabilities for the former and latter case respectively. 
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Setting/Pr„ai 

0.5 

0.6 

0.7 

0.8 

0.9 

1.0 

n = 16, Q = 0.4375, T = 4 

0.0131 

0.0221 

0.0374 

0.0777 

0.1853 

0.3162 

n = 11, Q = 0.4545, T = 9 

0.0217 

0.0278 

0.0302 

0.0444 

0.1320 

0.3708 

n = 10, a = 0.4, T = 10 

0.0176 

0.0211 

0.0200 

0.0311 

0.1003 

0.2663 

n = 5, a = 0.4, T = 15 

0.0814 

0.0951 

0.0919 

0.0869 

0.1640 

0.3189 

TABLE I: Pe,R versus 

Pmai under different settings. 


Setting/Pma! 

0.5 

0.6 

0.7 

0.8 

0.9 

1.0 

n = 16, a = 0.4375, T = 4 

0.0045 

0.0054 

0.0042 

0.0067 

0.0190 

0.0357 

n = 11, a = 0.4545, T = 9 

0.0093 

0.0090 

0.0058 

0.0043 

0.0048 

0.0046 

n = 10, a = 0.4, T = 10 

0.0101 

0.0079 

0.0060 

0.0038 

0.0023 

0.0011 

n = 5, a = 0.4, T = 15 

0.0339 

0.0301 

0.0297 

0.0294 

0.0177 

0.0087 


TABLE II: Pe,F versus Pmai under different settings. 



Fig. 2: vs a (Pmai = 1, e = 0.1, n = 100, T = 4) 



Fig. 3: Pe,R vs Praal (« = 0.4, £ = 0.1, 71 = 10, T = 4). 

The simulations were carried by considering networks with different number of nodes and various values of T in 
order to test the effect of these parameters on the detection performance as well as on the optimum strategy for 
the malicious nodes. 
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Fig. 4: Pf,^F vs Pmai (a = 0.4, e = 0.1, n = 10, T = 4). 

In Figure]^ we simulated a network with n = 100 nodes, with independent states. The simulations were carried 
out by letting Pmai = 1 and a ranging in the interval [0.4,0.49]. Upon inspection of the results, we can see that 
the optimal fusion rule achieves a lower P^ compared to the majority rule and the isolation schemes proposed in 
a and QO). 

In Figure we simulated a smaller network with n = 10 nodes, fixing a = 0.4 and letting Pmai vary in 
[0.5,1.0]. With such a small number of nodes, we were able to also simulate a network with a fixed number of 
nodes (fc = 4 = an), as shown in Figure]^ As a first observation, in both cases the optimum fusion rule outperforms 
the other rules achieving lower Pg. Moreover, the benefit from the optimum fusion rule is more evident when the 
number of Byzantines is fixed (in fact in this case the FC has more information about the distribution of malicious 
nodes over the network). With regard to Pmai, we can see that for the setting used in the figures, the optimum 
attacking strategy for the malicious nodes corresponds to always flipping the result of the local decision. Such a 
result agrees with previous works in Q, ifTOl . 

As a further analysis, we investigated the impact of T on the error probability. The results we obtained are given 
in Table |I| and Table [^ Due to complexity of the optimum decision fusion rule for the case of a fixed number of 
nodes, we had to limit the size of the simulations by letting n + T = 20. By looking at Table |Ij we see that, in the 
case of independent node states, the preferable strategy for the Byzantines is always Pmai = 10. Once again this 
agrees with the analysis carried out in liTOl . lfTSll . This is no more the case when the number of nodes is fixed. Table 
[Hj shows that when the observation windows T increases (T = 9,10,15 in the table), Pmai = 1 is no longer the 
optimal attack, which now corresponds to Pmai = 0.5. The intuition behind this apparently unexpected behavior is 
that when the byzantine nodes can be identified, and since the FC is assumed to know Pmai, the FC can revert the 
action of the Byzantines by simply inverting the reports received from such nodes. This is what the optimal fusion 
rule implicitly does. Interestingly such a behavior becomes evident in the case of fixed number of Byzantines only 
(and for a large enough observation window), since in such a case the a-priori information available to the FC is 
greater hence easing the identification of the malicious nodes. This also explains why in this case the optimum 
strategy for the Byzantines corresponds to Pmai = 0.5. With such a strategy, in fact, the mutual information between 
the system status and the reports is zero, thus minimizing the information the FC can rely on. Even if the above 
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results refer only to a limited number of cases, they lead us to conjecture the existence of a bimodal behavior for 
the optimum corruption strategy: when the FC can not identify reliably the byzantine nodes, try to directly induce 
a decision error by letting Pmai = 1, otherwise it simply minimize the information available to the FC by letting 
Pmal = 0-5. 


V. Conclusions 

We derived the optimum fusion rule for a FC receiving possibly corrupted reports regarding the status of an 
observed system. We considered two different setups corresponding to different a-priori information about the 
distribution of byzantine nodes. We showed the advantage of the optimum fusion strategy with respect to previous 
works through simulations, and identified a bimodal behavior of the optimum flipping probability to be adopted 
by the Byzantines. In future works we will try to remove some of the hypotheses behind our analysis, namely the 
necessity that the FC knows Pmai and the assumption that the byzantine nodes do not exploit the knowledge of 
Xi{t) to devise a more powerful attack. Casting the fusion problem into a game-theoretic setting where the FC and 
the Byzantines pursue opposite goals along the lines highlighted in IH is also an interesting research direction. 
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